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Abstract 

Quorum sensing is a decentralized biological process, by which a com- 
munity of cells with no global awareness can coordinate their functional 
behaviors based on cell-medium interaction and local decision making. 
This paper draws inspirations from quorum sensing and colony competi- 
tion to study the clustering problem. 

We propose an algorithm treating each data as a single cell, utilizing 
the knowledge of local connectivity to cluster cells into multiple colonies si- 
multaneously. The algorithm consists of two stages: first, it spots sparsely 
distributed "core cells" and determines for each cell its influence radius; 
second, core cells secrete "auto-inducers" that diffuse into the environment 
to form colonies. Interactions between colonies eventually determine each 
cell's identity. We combine the two steps into a dynamic process, which 
gives the algorithm flexibility to analyze both static and time-varying 
data. 

Finally, we test our algorithm on several applications, including syn- 
thetic and real benchmarks datasets, alleles clustering, and dynamic sys- 
tems grouping and identification. The results suggest that our algorithm 
performs as well as other cutting-edge methods on static data, while ap- 
plications on time- varying data like locations of swarms of robots are also 
promising. 

1 Introduction 

Quorum Sensing [I] [2] [3] [4] is a decentralized biological process, by which a 
community of bacteria cells interact with environment locally with no awareness 
of global information. Each cell secretes auto-inducers that diffuse into local en- 
vironment and build up concentration. These auto-inducers may be captured by 
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receptors, which can activate transcription of certain genes equipped in the cell. 
In V. fisheri cells, the receptor is LuxR and the mechanism is shown in Fig[TJ 
When few cells exist in the neighborhood, diffusion kejpg the density of the in- 
ducers low, so that no functional behavior will be initiated. However, when the 
concentration reaches a certain threshold, a positive feedback loop is triggered 
to secrete more and more auto-inducers and fully activate the receptors. Spe- 
cific genes are transcribed, and relevant function or behavior expressed by the 
genes will be performed collectively. We find that cluster analysis in computer 
science closely resembles this quorum-searching phenomenon. So in this paper 
we associate these two parts with each other to develop a novel algorithm. 

Cluster analysis is to separate a set of unlabeled objects into clusters, so 
that objects in the same cluster are more similar. Currently, many cluster- 
ing algorithms have been studied, such as hierarchical clustering (CURE [5], 
BIRCH [6]), centroid-based clustering (K-means [7]), distribution based cluster- 
ing (expectation- maximization algorithm), density based clustering (DBSCAN [5]) 
and spectral clustering (Normalized Cuts [3], Power Iteration Clustering [TU]V 
However, current techniques suffer from several limitations: many algorithms 
require input of cluster number; sensitivity to outliers and noise often influences 
clustering results; some algorithms fail to adapt to clusters with different den- 
sity or arbitrary shape. However, these problems seem to be easily solved in 
nature by herds of animals, flocks of birds, schools of fish and colonies of cells, 
whose robustness and flexibility far exceed artificial algorithms. Consequently, 
it is plausible to learn from nature for new clustering algorithms. 

Biological insights can inspire new algorithms [11], like work of Yehuda et 
al. on borrowing ideas from biology to solve the maximal independent set prob- 
lem [12) . We think they can also inspire connections between dynamic system 
control and machine learning algorithms. First, a biologically inspired learning 
algorithm can be converted into a dynamical process, which suits for real-time 
control requirements. Second, biological processes handle well with failures 
and dangers which is similar with the robustness and stability requirements for 
learning algorithms and dynamic control. Third, biological processes are mostly 
distributed systems, which may inspire local decision making strategies to con- 
trol swarms of dynamic systems. In this case bridging dynamic system control 
with machine learning through biology inspired algorithms is promising. 

In this paper, we develop a clustering algorithm inspired by quorum sensing 
and colony competition. It is not only able to perform clustering on bench- 
mark datasets, but also easy to be integrated with dynamic systems and control 
strategies. With further extensions made possible through this integration, con- 
trol theory would be more intelligent and flexible. For the following parts of 
the paper, we describe details of our algorithm in the second section. Then, we 
test our clustering algorithm with several experiments. Lastly, we discuss about 
the results shown in the previous sections, and discuss about future works and 
extensions. 




Figure 1: Quorum sensing model 



2 Algorithm Model 

To design the algorithm, we model the biological process of quorum sensing and 
colony competitions, including the auto-inducer density model, the local tuning 
policy, the colony establishments, interactions between and inside colonies and 
colony splitting and merging processes. 

2.1 Gaussian Distributed Density Diffusion 

We treat every data as a single cell and use the Gaussian kernel function to 
describe secretion of auto-inducers as: 
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f(x,Xi) = e ^ (1) 

We use Gaussian distribution because it constrains the influence of any single cell 
in a local region. And the oi can be considered as "influence radius" representing 
the secretion ability of each cell. In such way, we can map all the data into a 
kernel matrix M nxn 
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mij = f(xi,xj) = e (2) 
As we know that, f(x,Xi) is cell i's influence over environment. So my is the 
influence from cell j to cell i. Moreover, d = M x l„ x i is the vector representing 
local density of all the cells, where 

n n llgj-xjll 2 

di = m v = e °** ( 3 ) 

If the density of a certain cell is high, we say this cell is "recognized" and 
located in a well-established colony. Also it will be sensible to set a threshold 
on rriij to make M sparse, since we only want the cells connected with their 
local neighbors. 



2.2 Local Decision for Influence Radius Tuning 

Cells tune their influence radius to connect with neighbors and maintain local 
density. We design the process to minimize a "hunger factor" , which is the error 
between the density vector and a goal vector as a = a ■ l„ x i, then a — d is the 
error or the "hunger factor" vector. The "hunger factor" information can be 
carried with the auto-inducers whose needed input is the local density that can 
be sensed locally. We use the quadratic error as the cost function: 

Vdensity = 1 Inxl — M ■ l n xl|| 2 

To minimize it, we take the time derivative of V. 
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We name the Jacobian matrix as J = {-§gd) 
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Then Ja = 2il "'^" e 
Proposition I. 

<j = J T (a -(?)(*) 
With this tuning policy for oVs, we can have 
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In more details, 
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Here, each term is composed of two parts: the (a — dj) term represents the 
"hunger factor" of surrounding cells, and the Jji part represents cell i's poten- 
tial to satisfy their needs. 

However, this proposition is easy to trigger "over-fitting" or get trapped in local 
minimums. Ill-posed results, such as "super cells" with infinite influence radius, 
may emerge. On the other hand, it is not feasible for distributed computation 
like swarms of robots since for any robot to make decisions, it needs complete 
information of all other agents. 



Proposition II. 

3 = J T (a - d) + /3(M - D)a - aa + f imt 



We regularize the equation in Proposition I. with terms concerning ||<r|| and 

n 

Y] rriijicrj — <Ji). For /3(M — D)a, we add the term based on the assumption 
that cells near each other have similar secreting ability. The D matrix is a 

n 

diagonal matrix, with the entries Da = ^ m^. So the ith term in the vector 

n 

P(M — D)a equals to ^ mij((Xj — cr,-), which provides diffusive bonding between 

cells' influence radius in a neighborhood. Also, we add inhibition term —aa, so 
that no "super cell" will emerge despite of existing needs, due to dissipation and 
its own capability, as it is in nature. The fi n u term provides initial actuation 
to expand the influence radius, which disappears after most of the cells have 
been connected with their neighbors. It can be regarded as an exploration stage 
searching local communities. Yet still, this proposition requires global informa- 
tion. 



Proposition III. 

ff = M(S- d) + fi(M - D)a - aa + f imt (6) 

We propose the local tuning policy, replacing J T with matrix M. Here M(a—d) 
is the local hunger factor vector accumulated at the location of each cell, which 
is the driving force tuning the a. When the local environment appears to be 
"hungry" (local hunger factor is positive), the cell tends to increase its influence 
radius to satisfy the demand, and vice versa. Our algorithm is more biologically 
reasonable in this case: the auto-inducers secreted by cells carry the demand 
information, and spread as the density distribution. 

Assume that influence radius in the same colony are mostly similar, then 
M rj M T ', thus all entries in J T only adds a proportional term — f 3 ^ before 
rriij. So intuitively, M should be a good approximation. 

Moreover, we use contraction analysis [T3] to prove the convergence of both 
Proposition II and III. Contraction analysis proposes that for the system x = 
f(x,t), if there exists a constant /3 > 0, such that for Vx,\/t > 

then all solutions converge exponentially to a single trajectory, independent of 
the initial conditions. More details with other complex metrics can be referred 
to [13j . For both propositions, we treat them as a = /((?) — eta, with the 
Jacobian matrix F = For Proposition II, after rescaling the data such that 
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Assume after setting a threshold for m^s each cell has less than 5a neighbors, 

then | Y^j Fiji — 15, so make a = 15, we can have g| = F — al as a negative 
diagonally dominant matrix, so that the system is contracting, and converging 
to a single equilibrium. 

For Proposition III, after rescaling the data such that V?,j, \\Xi — Xj\\ > a 2 
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Similarly with the less than 5a neighbors assumption, | . < 5, make a = 5, 
we have the system for Proposition III contracting, and converge to a single 
equilibrium. 

The convergence proof by contraction analysis is relatively conservative. In 
our simulations, we can get the system converging to a stable equilibrium with 
much smaller a choice. 



2.3 Colony Establishments and Interactions 

In quorum sensing, when the concentration surpasses a predefined threshold, 
cells begin to produce specific functional genes to perform group behavior. In 
our algorithm, we use this trait as the criterion for establishing a colony. When 
the density of a cell belonging to no colony, di surpasses a predefined threshold, 
we establish a new jth colony originating from it and add auxl colony vector 
Cj into the colony matrix C, where C — [c{, c%, Cj-i], with the only non-zero 
entry as 1 in the ith term, which is also Cij. 

In the Normalized Cuts algorithm [9 , which is a cutting edge spectral clustering 
algorithm widely used for image segmentation tasks, it is designed to minimize 
the cost function: 

Ncuts(A,B)= CUt{AB) ' CM{B > A) 



assoc(A, V) assoc(B, V) 
where 

cut{A,B)= 

ma 

and 

asso^A, V) — rriij 



We design the colony interactions to minimize a cost function similar to the 
Normalized Cuts: 

V co i on y = ^cf(M + M T )^-^cf(M + M T )3 



= ^<? l T (M + M T )(c e -( 7 + l)c,) 



Here, c e = Yli Si Consequently, by making 

Si = -(M + M T )c e + (7 + 1)(M + M T )c, 

V^Zcny - -^(c e -( 7 + l)Q) T (M + Af T ) 2 (c e -(7 + l)& i ) 

i 

< 

We can also transform the interaction equations into a matrix form, where 
C e is a matrix with every column same as c e : 

C = -{M + M T )C e + (7 + 1)(M + A/ T )C* 

Summing all colony vectors to achieve the environmental colony vector c e 
also follows the idea of quorum sensing to simplify the calculation through using 
global variable updates instead of calculating every component. Here all entries 
in C are saturated in the range of [0, 1]. The interactions between colonies are 
composed of two parts, one part is the mutual inhibitions between colonies, and 
the other is self- promotion of colonies into the environment. When initially the 
colonies have not yet been well developed, there is only colony self-expanding, 
like a neighbor-to-neighbor infection. After the colonies have been expanding 
for a while, some colonies become neighboring to each other, and the mutual in- 
hibition comes into effect. So finally it will be a balance between self-expanding 
and inhibitions from others. 

Furthermore, we can explain the interaction in a micro view at the bound- 
ary of two competing colonies as shown in Figj2j suppose that we have two 
colonies A and B neighboring with each other, with colony vector as ca and cb 
respectively. For a single cell i in the boundary area between the two colonics, 
following the interaction rules, 



= ~y^(m^- + mji)c B j +J*^2{mi3 + m^CAj 



c B i = - y^X m v + m oi) c Aj + 7 y^(mjj + rnji)c B j 
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When 7 = 1, it is obvious that CAi — —CBi, so if accumulated influence from 
colony A is larger than from colony B, as J2j( m ij + m ji)cAj > J2j( m ij + 
rriji)cBj, then finally cm — 1, cs^ = 0, vice versa. Eventually each row in C will 
have at most one non-zero entry as 1, on the column, whose colony has most 
accumulated influence towards the cell. 

7 is the parameter measuring the relative strength of inhibition and expanding 
forces: when 7 < 1, the inhibition force is enhanced, there might exist some 
blank boundaries between colonies. While when 7 > 1, it is easier for the colony 
factors to spread into neighboring colonies. So at the beginning, it would be wise 
to tune up 7, to speed up newborn colonies growing, and enhance distributed 
small colonies merging. Later when colonies have become stable, we can tune 7 
back to 1 to achieve a distinct clustering result. 




Figure 2: The interactions between two colonics 



2.4 Colony Merging and Splitting 

Among the established colonies, some may be well connected to each other, while 
also there may be new colonies. Such scenarios require rules for merging and 
splitting colony parts. We calculate the ratio between inter colony connections 
and intra colony connections as a criterion measuring the merging possibility 
for one colony into another: 

cf(M + Af T )cj 
rij cf(M + MT)ci 

We set a threshold, such that if is large enough, colony i is merged into colony 
j. On the other hand, there may be occasions that new clusters are split from 
previous colonies. So we set a continuity detecting vector si for each colony: the 
evolution of si follows the same rules of colony interactions. When the detecting 
process reaches a stable equilibrium, we restart it all over again. Cells identified 
as outliers in each iteration are marked as "no longer recognized" and become 
available for forming new colonies upon again. 



(7) 



2.5 Clustering Result 



Finally, we get the result by choosing the maximal entry of each row in matrix 
C, that determines the colony identity of each cell. And cells with null rows are 
regarded as outliers. 

Pseudo Code of the proposed algorithm is presented below: 



1. Initialize a as 0, form the M matrix, set the parameters a, b, /?, 7 

2. Begin the_process: 

ff = M{a-d)+ (3(M - D)S - aa + f mlt 
Detect new cluster: 

if 3di > b(b < a) and cell i not recognized by any colony 

create a new colony using cell i as core cell 

end 

C = — (M + M T )(C e -C)+ 7 (M + M T )C 
S = -(M + M T )(S e -S) + 7 (M + M T )S 
Cluster segmented detection: 

if in the stable state, S 7^ C 

update C = S and accept new born clusters 

end 

_ cf (M+M T )c j 
Vl 3 ~ cf(M+M T )ci 

Cluster merging: 
if 3r y > 0.2, i ^ j 

then we can merge the colony i into colony j 
end 

3. Achieve the clustering results by counting the C matrix 



For the parameters choices: 7 defines the ability of mutual penetration and 
crossing density gaps. So with larger 7, colonies are more inclined to merge. 
(3 measures similarities of oVs in local neighborhood. Larger j3 would result 
in smoother oVs distribution, yet potentially rugged density distribution, a 
measures sparsity of the connection graph. With a more connected graph, we 
tend to have fewer clusters, and vice versa. Hence, we have some basic rules to 
tune the parameters: if the result suggests fewer clusters than we expect, we can 
tune down a and 7, and if the influence radius of some cells become too large, 
we can tune up /3 and a. In the future, we can design rules tuning parameters 
dynamically according to our expectations. 

3 Experiments 

We test our algorithm on several applications, including synthetic and real 
benchmarks datasets, alleles classifications, and dynamic systems grouping and 
identification. 



3.1 Synthetic Benchmarks Experiments 



We provide the clustering results on four difficult synthetic occasions that are 
nonlinearly separable and follow no specific distribution: the two-chains model, 
the double-spirals model, the two moons model and the island model. The 
results are shown in Fig. [3] proving that influence radius tuning ensures that 
the density distribution closely fits the topology of data, which provides distinct 
separation boundaries. 

Island model Two spiral Two moon Two chain 



Original 




Result 



Figure 3: Clustering results on synthetic benchmarks 



3.2 Real Benchmarks Experiments 
Iris flower dataset 

The Iris flower dataset [H], introduced by Sir Ronald Fisher, consists of 150 
instances forming 3 clusters, of which two are only nonlinearly separable. 
Pendigits dataset 

The Pendigits datasets [T5] is a multivariate dataset of 10992 instances, each 
with 16 attributes. We randomly choose 1000 instances for clustering. Also, 
we build two subsets of the dataset to test: PenDigitsOl (easier) and PenDig- 
itsl7(harder), with digits "0", "1" and "1", "7", respectively. 
Polbooks dataset 

PolBooks [TB] is a co-purchase network of 105 political books. Each book is 
labeled "liberal" , "conservative" , or "neutral" , mostly in the first two category. 
Here we compare our results with cutting-edge algorithms including Normalized 
Cuts [9], Ng- Jordan- Weiss algorithm [17] and Power Iteration Clustering [TP] , 
shown in Table [I] For the Iris dataset, our performance is comparable to the 



cutting edge methods. For the Pen-digits data, we can cluster 10 classes simul- 
taneously with overall correctness rate 86.6% while other methods don't have 
such ability. Moreover, for the two subcases, we outperform the comparisons. 
For the network segmentation task of Polbooks dataset, although our algorithm 
is not designed specifically to solve such tasks, the segmentation result is still 
very satisfying. 



Table 1: Clustering result comparison with NCut, NJW and PIC 



Dataset 


Instances 


Clusters 


NCut 

(%) 


NJW 

(%) 


PIC 

(%) 


Ours 

(%) 


Iris 


150 


3 


67.3 


80.7 


98.0 


97.3 


Pendigits 


1000 


10 








86.6 


PenDigitsOl 


200 


2 


100.0 


100.0 


100.0 


100.0 


PenDigitsl7 


200 


2 


75.5 


75.5 


75.5 


81.5 


Polbooks 


105 


3 


84.8 


82.3 


86.7 


83.8 



3.3 Novel Experiment on Application for Alleles Cluster- 
ing 

As introduced in |18j . it is very important to understand the similarities of 
DRB (HLA-DR chain) alleles for the designation of high population coverage 
vaccines through classifying the supertypes of them. In the work of [18], Wen- 
Jun et al. analyzed on 559 DRB alleles, and proposed a kernel matrix based on 
BLOSUM62 matrix measuring the distance between the alleles as 

D L *(a,b) = (^^(4(a,c)-4(t lC )) 2 )5 

So we utilize proposed distance in our Gaussian kernel, with the same tun- 
ing policy. The clustering result compared to |18j is shown in Table [2j Our 
result has no misclassification errors. The only difference is that the 25 al- 
leles classified as outliers have been classified into some clusters in [TH] using 
hierarchical clustering methods. Outliers such as DRB5*0112, DRB1*1525, 
DRB1*1425, DRB1*1442, DRB1*1469 and DRB1*0832 are discussed as excep- 
tions in their result, which makes their classification more doubtful. Also we 
share same conclusions on exceptions like DRB1*0338, DRB3*0115; and like- 
wise classify DRB1*1440 , DRB1*1469, DRB1*1477, DRB1*1484, DRB1*14116 
and DRB1*14102 into the ST8 supertype. 

This experiment proves that our algorithm is effective on clustering multiple 
clusters simultaneously for alleles clustering data, and also our results support 
the conclusions of Wen-jun et al.'s work on the mathematical foundation anal- 
ysis of amino acid chains. The detected outliers may lead further analysis and 
provide potential directions to biological researchers. 



Table 2: Clustering result comparison of alleles clustering 



Supertype 


Number of Alleles 


Misclassificd 


Outliers 


ST52 


43 








ST3 


63 





6 


ST6 


100 





2 


ST8 


52 





2 


ST4 


93 





6 


ST2 


68 





1 


ST5 


34 





1 


ST53 


6 








ST9 


16 





1 


ST7 


18 





1 


ST51 


15 








STl 


34 





2 


STIO 


3 





3 


Overall 


25 





25 



3.4 Experiments on Dynamic System Grouping 

In this section, we introduce applications on clustering dynamic systems, which 
show our capability to deal with time varying data in a continuous way. The 
clustering results that we achieve are flexible and changing according to the 
distribution of data, which can be regarded as an integral over time. 

3.4.1 Application I. Real-time clustering of mobile robots 

In natural colonies like fish schools or hordes of buffalos, groups merging and 
splitting are so smooth and elegant, which gives the colonies flexibility to deal 
with obstacles and dangers. Our first application is to cluster time- varying data 
such as locations of working mobile robots imitating the phenomenon. Also 
as explained in [19j . synchronization will enhance the ability to resist noise and 
improve robustness of the whole network. So potentially, we can couple agents in 
the same group, to achieve synchronization by forming contracting Lagrangian 
systems as introduced in |20j and [21] . Such dynamic grouping and coupling 
would help enhance control performance, which can be analyzed in the future. 

In our simulation, 200 robots are located as the previously introduced two- 
moon dataset, moving around their center locations with radius of 0.5 and 
random speed. During that time, 30 new robots join the group, and also some 
robots migrate to form new clusters. We update the distance matrix D and in- 
fluence matrix M continually. For potential real world application, we can use 
electromagnetic field emitters, and electromagnetic field intensity sensors to ac- 
tualize the designed mechanism, replacing the agent-to-agent communications 
with agent-environment-agent interactions, which is the core idea of quorum 
sensing. 



From videqj and Fig|5j we can see that the cluster number is first merged 
down to 3 clusters, and then varies with the merging and splitting events, ex- 
actly describing the real-time variation of the robots migration. And shown 
in Fig|4j the influence radius tuning is capable of handling local density vari- 
ations: it is tuned down responding to a local high density, and vice versa, to 
preserve balance. The results prove our capability of dealing with time varying 
data by using the accumulated information to do dynamic clustering, and han- 
dling variations of cluster numbers as well. The results also suggest that using 
our algorithm to group dynamic systems is feasible and further applications on 
synchronization and coordination are promising. 
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Figure 4: Variation of density and influence radius of a single cell 
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Figure 5: Cluster numbers of over the simulation time 



3.4.2 Application II. Multi-model switching 

As introduced in |22j |23j . multi-model switching control could improve the 
transient control performance and precision. Here we propose a new method 
for multi-model switching control: 

x http: / /www.youtube.com/watch?v=EshxTGNpQC4 



Suppose we have a system with unknown parameters, however we know that 
there arc limited possible choices of parameters configuration. 

1. Initially, we use adaptive control to assure acceptable performance. Si- 
multaneously, we have tens of virtual systems simulating with the same 
control input whose parameters scatter around the pre-known choices. 

2. When the density of virtual system is stable after tuning, we calculate 
local density of the real system: 

" _ll/r-/il| 2 

i 

where f r and fi are Fourier transform vectors of the input of the real 
system and virtual systems. 

3. If d r exceeds a predefined threshold, we know the real system belongs to a 
virtual cluster. Hence we can get the real parameters and switch to robust 
control. 

4. Further, if the parameters vary again, by detecting d r dropping down, we 
can resume adaptive control and wait for the next time that d r surpasses 
the threshold. 

For experiment, we use 60 virtual dynamic systems as rriiXi + bi\xi\xi + kiXi = 
Ui^rrii,bi,ki are unknown constants. The parameters scattered 
around three known choices: [4, 3, 2], [2, 4, 3] and [3, 2, 4], with 20 systems each. 
And we have a "real" system, whose parameters m r ,b r ,k r are set as [4,3,2] 
initially, and then changed to [2,4,3]. Assume the task is to track the trajectory 
of Xd(t). According to adaptive control theory, we need control law: 

u = m(xd — X(x — id)) + b\x\x + kx — k±s (9) 

where s = x + Ax, k\ > is a constant, and adaptive law: 

rh = -s(x d - \(x - x d )) 

b = — s\x\x 
k = —sx 

Since we know rh(x d — X(x — x d )) + b\x\x + kx converges to asymptotically 
and thus u w m(x d — \(x — Xd)) + b\x\x + kx, systems with similar parameters 
require similar inputs. So that we can use the input Fourier transform vector 
to measure distance between systems in our algorithm. 

As we can see from the simulation results Figj6] and FigjTJ soon after the 
multi-model switching starts at t = 10 seconds, the density of the real system 
surpasses the threshold 5, and the parameters are estimated correctly. After the 
parameters change at t = 20 seconds, sharply the density drops along with the 
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Figure 6: Parameter estimations of the real system 




Figure 7: Density of the real system 



control mode switched to adaptive control. After about another 10 seconds, the 
density is high again, and the system is correctly estimated with new parameters. 

With the applications above, we show the potential of combining our algo- 
rithm with dynamic systems. The algorithm attempts to imitate the smooth 
grouping and coordination of natural colonies and the results prove the reliabil- 
ity of proposed algorithm. 

4 CONCLUSIONS 

This paper presents our dynamic clustering algorithm inspired by quorum sens- 
ing as a potential bridge between machine learning and control. Experiments 
on different types of unlabeled datasets show that our algorithm performs as 
well, if not better, as some cutting edge clustering methods on static datasets 
and performs well on dynamic clustering tasks. Our algorithm's advantage over 
existing algorithms can be concluded as: 

1. Since the influence radius is tuned to preserve local connectivity, we can 
adapt to clusters with different sizes and variations. We can cluster non- 
linearly separable data that follow no specific distribution, and handles 
noise and outliers well. 



2. Our decentralized algorithm is potentially suitable for parallel and dis- 



tributcd computation. Potential applications on real mobile robots would 
replace agent-agent communications with agent-environment interactions. 

3. We can cluster multiple colonies simultaneously. And the segments are 
dynamically adjusted through smooth merging and splitting. 

4. The algorithm is dynamic, which is easy to be combined with real me- 
chanical systems. Novel control that is more flexible and robust theory 
for coordinating complex systems may emerge. 

The overall computation complexity would be O(n) 3 with single processor. How- 
ever, if we use the algorithm on real robots clustering, with distributed com- 
putation, the computation of single robot would be hugely reduced to linear 
time. 

For future work, first, we will conduct further research developing rules dy- 
namically tuning the parameters. Second, for dynamic systems' metrics, we may 
need various better methods for extracting feature vectors rather than just an- 
alyzing the fast Fourier transform vector of the inputs. Third, we will look into 
the possibilities of using our clustering algorithm on much more applications 
involved with dynamic systems. And last but not least, we will develop new 
control theories that utilize the information gained from our novel algorithm, 
to further improve control performance and synchronization with more natural 
self-organizing coordination. 
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